Performance Improvement of Data Mining in Weka through GPU Acceleration
نویسندگان
چکیده
Data mining tools may be computationally demanding, so there is an increasing interest on parallel computing strategies to improve their performance. The popularization of Graphics Processing Units (GPUs) increased the computing power of current desktop computers, but desktop-based data mining tools do not usually take full advantage of these architectures. This paper exploits an approach to improve the performance of Weka, a popular data mining tool, through parallelization on GPU-accelerated machines. From the profiling of Weka object-oriented code, we chose to parallelize a matrix multiplication method using state-of-the-art tools. The implementation was merged into Weka so that we could analyze the impact of parallel execution on its performance. The results show a significant speedup on the target parallel architectures, compared to the original, sequential Weka code. c © 2014 The Authors. Published by Elsevier B.V. Selection and peer-review under responsibility of Elhadi M. Shakshuki.
منابع مشابه
Guest editorial: new developments in future networked systems
This special issue is based on the best papers from the 5th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN-2014), which was held in Halifax, Canada, on 22–25 September 2014. The conference attracted a large number of scientific papers that contributed to the state-of-the-art in the ground-breaking invention to future communication technologies, including m...
متن کاملAn Analysis of Urban and Rural Students Programming Skills Performance Using Clustering Techniques
37 Abstract— Educational Data mining is a recent trend where data mining methods are experimented for the improvement of student performance in academics. The work describes the mining of higher education urban and rural students’ related attributes such as improvement, behavior, attitude and relationship. The data were collected from Dr.Ambedkar Government Arts College in terms of the mentione...
متن کاملApplication of Kansei engineering and data mining in the Thai ceramic manufacturing
Ceramic is one of the highly competitive products in Thailand. Many Thai ceramic companies are attempting to know the customer needs and perceptions for making favorite products. To know customer needs is the target of designers and to develop a product that must satisfy customers. This research is applied Kansei Engineering (KE) and Data Mining (DM) into the customer driven product design proc...
متن کاملDeveloping innovative applications in agriculture using data mining
The WEKA (Waikato Environment for Knowledge Analysis) system provides a comprehensive suite of facilities for applying data mining techniques to large data sets. This paper discusses a process model for analyzing data, and describes the support that WEKA provides for this model. The domain model ‘learned’ by the data mining algorithm can then be readily incorporated into a software application....
متن کاملHigh Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation
Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...
متن کامل